klotz: retrieval-augmented generation* + llm*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

  1. The article explains semantic text chunking, a technique for automatically grouping similar pieces of text to be used in pre-processing stages for Retrieval Augmented Generation (RAG) or similar applications. It uses visualizations to understand the chunking process and explores extensions involving clustering and LLM-powered labeling.
  2. Archyve is a web app that enhances pretrained language models with user's documents, while keeping them on the user's own devices and infrastructure. It provides an API for querying documents, an LLM chat UI, and an API for third-party LLM chat UIs.
    2024-08-30 Tags: , , by klotz
  3. This article discusses the importance of chunking, embedding, and indexing in RAGs (Recursive Auto-Segmented Graphs). The author compares recursive character splitting and semantic splitting techniques for text chunking and suggests the use of agentic chunking for superior RAG retrieval.
    2024-08-27 Tags: , , , by klotz
  4. This article explores the limitations of position-based chunking in Retrieval Augmented Generation (RAG) systems and proposes semantic chunking as a better alternative for improved performance.
    2024-08-24 Tags: , , , by klotz
  5. This article explores how to incorporate images into a RAG (Retrieval-Augmented Generation) knowledgebase using Large Language Models (LLMs) with vision capabilities. It provides a step-by-step guide to collecting, uploading, and transcribing images for a richer and more detailed knowledgebase.
  6. This article explores the challenges and considerations of implementing Retrieval Augmented Generation (RAG) systems for real-world business applications, beyond simple demos. It covers data handling, performance optimization, and the importance of aligning RAG with specific business goals.
    2024-08-30 Tags: , , by klotz
  7. This article introduces Graph RAG, a method for enhancing Language Model (LLM) applications by incorporating knowledge graphs. It explains the limitations of traditional text embedding-based retrieval and how Graph RAG addresses them by providing a global understanding of the knowledge base through community detection and report generation.
    2024-08-23 Tags: , , , by klotz
  8. A list of 13 open-source software for building and managing production-ready AI applications. The tools cover various aspects of AI development, including LLM tool integration, vector databases, RAG pipelines, model training and deployment, LLM routing, data pipelines, AI agent monitoring, LLM observability, and AI app development.
    1. Composio - Seamless integration of tools with LLMs.
    2. Weaviate - AI-native vector database for AI apps.
    3. Haystack - Framework for building efficient RAG pipelines.
    4. LitGPT - Pretrain, fine-tune, and deploy models at scale.
    5. DsPy - Framework for programming LLMs.
    6. Portkey's Gateway - Reliably route to 200+ LLMs with one API.
    7. AirByte - Reliable and extensible open-source data pipeline.
    8. AgentOps - Agents observability and monitoring.
    9. ArizeAI's Phoenix - LLM observability and evaluation.
    10. vLLM - Easy, fast, and cheap LLM serving for everyone.
    11. Vercel AI SDK - Easily build AI-powered products.
    12. LangGraph - Build language agents as graphs.
    13. Taipy - Build AI apps in Python.
  9. This article explores how to implement a retriever over a knowledge graph containing structured information to power RAG (Retrieval-Augmented Generation) applications.
  10. This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: Tags: retrieval-augmented generation + llm

About - Propulsed by SemanticScuttle